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5 BACKGROUND OF THE INVENTION 

This invention relates in general to systems for finding profiles of topographical 
features of small dimensions, such as those of a diffracting grating, and in particular to 
such systems using optical spectroscopic techniques. 

As the integration density and speed of microelectronic devices increase, circuit 

10 structures continue to shrink in dimension size and to improve in terms of profile edge 
sharpness. The fabrication of state-of-the-art devices requires a considerable number of 
process steps. It is becoming increasingly important to have an accurate measurement of 
submicron linewidth and quantitative description of the profile of the etched structures on 
a pattern wafer at each process step. Furthermore, there is a growing need for wafer 

15 process monitoring and close-loop control such as focus-exposure control in 

photolithography. 

Spectroscopic diffraction-based techniques are especially well suited for 
microelectronics metrology applications because they are nondestructive, sufficiently 
accurate, repeatable, rapid, simple and inexpensive relative to critical dimension-scanning 

20 electron microscopy. In such diffraction-based analysis techniques, typically a model of 
the profile is first constructed, where the model includes a number of parameters that can 
be varied. One or more diffraction intensity versus wavelength curves are calculated 
based on the model constructed and the curve(s) are compared with measured diffraction 
data from the sample. The parameters are then adjusted until a match is found between 

25 the curve(s) and the measured data. 

The current methods being used include multi-slab models where a number of 
rectangular or trapezoidal slabs are put on top of one another to form a seed profile that is 
an approximation of the profile being measured. The parameters that can be adjusted 
include width and height of the rectangles or width, height and sidewall angle of the 

30 trapezoids. It is found that in the wafer processing processes, a number of very different 
profiles of structures may be encountered. The current methods are inadequate for 
measuring a wide variety of very different profiles in the manufacturing process. A 
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simple increase of the number of slabs to model such variety of profiles requires the 
generation of huge libraries whose size grows exponentially with the number of slabs and 
the associated parameters. Furthermore, different sets of parameters, corresponding to 
different profiles, can produce indistinguishable spectroscopic data, resulting in a 

5 problem known as cross-correlation. 

In U.S. Patent No. 5,963,329, Conrad et al. proposed an improved method to 
measure actual profiles. In this model, the number of independent parameters or 
variables is reduced by adopting particular profile shapes such as a "S" line profile, by 
dividing the model line profile into two or more sub-profiles and providing a numerical 
10 model of each sub-profile so that fewer scaling factors may be used to adjust all slab 
widths and heights within the single sub-profile. 

While the above-described method of Conrad et al. reduces the number of 
parameters that one needs to contend with, this method still has some drawbacks. Thus, it 
cannot be used for measuring line profiles made of more than material, and for measuring 
15 optical parameters as well as geometric parameters. It is therefore desirable to provide an 
improved model that can be used for determining the above mentioned samples in a 
manner so that the solution converges to a single solution without a high risk of cross- 
correlation. 

As noted above, the shapes of line profiles encountered on semiconductor wafers 
20 during fabrication can take on a wide variety of shapes. Such line profiles are typically 
situated on and/or below layers of materials which may be the same as or different from 
the material of the profiles. When diffraction-based spectroscopic techniques are used to 
measure such profiles, the radiation used in the technique would interact with the one or 
more layers and transmitted or reflected radiation from the layers is detected by the 
25 detectors that are used for detecting radiation from the line profile. Where it is not 
possible or very difficult to separate the contribution of the signal due to the layers from 
the contribution of the signal due to the line profile, it is desirable for any technique used 
to measure the parameters of such layers simultaneously with measurement of the line 
profile. None of the existing techniques has such capability. It is therefore desirable to 
30 provide an improved system where the contribution of such layers to the detector signal 
can be taken into account. 
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SUMMARY OF THE INVENTION 

Semiconductor devices are fabricated by processing equipment with certain set 
parameters of the manufacturing process, such as the time, temperature, focus and 
exposure dose in the lithography and other parameters, such as the time and temperature 
5 for the deposition of certain layers, or the time, and nature of etching processes. Once 
these parameters are known, it is possible to simulate the profile of the structures that will 
result from such manufacturing process. A gallery of seed profiles or profile types may 
be used as possible starting points for finding the actual shapes of line profiles. 
Preferably, knowledge of manufacturing process parameters may be utilized in the 
10 construction of a gallery of profile types from which a particular profile type can be 
chosen for matching with the measured data. Also preferably, knowledge of 
manufacturing process parameters is utilized to select from the gallery a particular profile 
type that would serve as the best seed profile for the purpose of finding the actual profile 
of structures.. 

\ 5 As noted above, the diffracting structure to be measured is frequently located on 

and/or below one or more layers of the same or different material, so that the detector 
employed would detect radiation influenced by such layers as well as diffraction from the 
diffracting structure. These layers would have to be taken into account in the model. 
Parameters such as thickness and index of refraction (n and k) of these layers would be 
20 more sensitive to certain measurement parameters than others. This is also true of the 
parameters characterizing the diffracting structure. Therefore, in another embodiment of 
the invention, more than one set of radiation data may be generated from each profile 
type, where the sets of radiation data generated are of different radiation parameters, such 
as reflectance or transmittance parameters and ellipsometric parameters. For a given 
25 change in the parameter of the profile type (e.g., width, height, sidewall angle, index of 
refraction of the diffracting structure and thickness and index of refraction of the one or 
more layers) may be more sensitive to the ellipsometric parameters than to the 
transmittance or reflectance parameters, or vice versa. In such event, it may be desirable 
to choose the set of radiation data and the associated radiation parameters that are more 
30 sensitive to a change in the parameter of the profile or a characteristic of the one or more 
layers to improve the accuracy and precision of the modeling and matching algorithm. 
This feature can also be used where the effects of the layers need not be taken into 
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account, such as where the effects are known, can be ignored or where there is no layer 
associated with the structure. 

Independent of the above considerations, reflectance or transmittance parameters 
and ellipsometric parameters of the collected radiation may be used together for deriving 
5 one or more parameters of a profile with arbitrary shape. 

The gallery of profile types may be stored in a database made available to users 
and an optional processor may be used to select the profile type from the gallery and 
compare the detected measured data to that associated with the selected profile type to 
arrive at a set of values of the one or more parameters of the profile type. 
10 Where the profiles measured are useful for controlling a wafer manufacturing 

,^ process, the measured information may be used to control the processing system for 

C= adjusting one or more processing parameters. Thus, if the profile of the structure 

if measured indicates a problem in the processing system, the processing system may be 

adjusted to reduce or eliminate the effects of the problem. Any one of the above- 
Ql 15 described techniques may be used to find a profile of a structure and/or characteristics of 

1 " one or more layers in the vicinity of the structure, and these values may then be supplied 

^ to a semiconductor wafer processing machine, such as a track, stepper and/or etcher, to 

H control the lithographic and/or etching process in order to compensate for any errors in 

fi one or more parameters of the profile that has been discovered. The track, stepper and/or 

20 etcher may form a single tool with a system for finding the one or more parameters of a 
profile, or may be instruments separate from it. 

While the above-described features may be implemented as a stand-alone system 
and integrated with optical equipment for carrying out the measurements, it is possible for 
existing optical measurement equipment to be modified or otherwise enabled so that it 
25 has the capability described above. Thus, the above-described features may be embodied 
as a program of instructions executable by computer to perform the above-described 
different aspects of the invention. Hence any of the techniques described above may be 
performed by means of software components loaded into a computer or any other 
information appliance or digital device. When so enabled, the computer, appliance or 
30 device may then perform the above-described techniques to assist the finding of value(s) 
of the one or more parameters using measured data from a diffracting structure and/or the 
associated one or more layers. The software component may be loaded from a fixed 
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media or accessed through a communication medium such as the internet or any other 
type of computer network. 

Each of the inventive features described above may be used individually or in 
combination in different arrangements. All such combinations and variations are within 
the scope of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1A is a schematic view of a spectroscopic measurement device useful for 
illustrating the invention. 

Fig. IB is a cross-sectional view of a two-dimensional grating and associated 
layers useful for illustrating the invention. 

Fig. 2 is a schematic view of another spectroscopic measurement device useful for 
illustrating the invention. 

Figs. 3A ? 3B, 3C are cross-sectional views of two-dimensional structures 
encountered in semiconductor manufacturing useful for illustrating the invention. 

Fig. 3D is a perspective view of a three dimensional periodic structure with via 
holes useful for illustrating the invention. 

Figs. 4A-4F are sample profiles to illustrate a gallery of profile types or models to 
illustrate an embodiment of the invention. 

Fig. 5A is a flow chart of profile and film measurement to illustrate an 
embodiment of the invention. 

Fig. 5B is a flow chart illustrating in more detail the diffraction solver in the flow 
chart of Fig. 5 A. 

Fig. 6A is a flow chart illustrating the selection of the optimum profile type or 
model and the value of parameters for the initial seat values. 

Fig. 6B is a flow chart illustrating the process for selecting the optimal radiation 
parameter and the corresponding set of radiation data for matching with measured data to 
illustrate one aspect of the invention. 

Fig. 6C is a schematic diagram illustrating the selection of the starting point for 
nonlinear optimization from a course library to illustrate an aspect of the invention. 

Fig. 7 is a schematic block diagram illustrating a wafer processing apparatus 

including a track/stepper and an etcher and a spectroscopic measurement device where 

information from a diffracting structure and/or associated structures from the device as 
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used to control the manufacturing process and the track, stepper and/or etcher to illustrate 
the invention. 

Fig. 8 is a schematic block diagram illustrating in more detail the track/stepper of 

Fig. 7. 

Fig. 9 is a block diagram showing a representative sample logic device in which 
aspects of the present invention may be embodied. 

For simplicity and description, identical components are labeled by the same 
numerals in this application. 

TYHTATT K D DESCRIPTION OF THE EMBODIMENTS 

Even though much of the description below of algorithms and methods are 
described in terms of the reflected or transmitted intensities of the diffraction caused by 
the diffracting structure, it will be understood that the same techniques and algorithms 
may be used for data containing information concerning changes in the polarization state 
over different wavelengths (e.g. ellipsometric parameters A and *¥ as functions of 
wavelength). For this reason, it may be advantageous to employ an instrument which is 
capable of measuring both the reflected or transmitted intensities of the diffraction caused 
by the structure as well as changes in polarization state caused by the diffraction of the 
structure. A suitable system is described below in reference to Fig. 1 A. 

Fig. 1 A is a schematic view of a spectroscopic diffraction-based metrology system 
to illustrate the preferred embodiment of the invention. As shown in Fig. 1A, system 10 
may be used to measure reflected or transmitted intensities or changes in polarization 
states of the diffraction. As shown in Fig. 1A, a semiconductor wafer 11 may comprise a 
silicon substrate 12, and a structure 16 thereon that may include a photoresist pattern on 
and/or over film stack(s), where the film(s) are at least partially light-transmissive and has 
a certain film thickness and refractive index (n and k, the real and imaginary components 
of the index). 

An XYZ stage 14 is used for moving the wafer in the horizontal XY directions. 
Stage 14 may also be used to adjust the z height of the wafer 11. A polychromatic or 
broadband radiation source such as white light source 22 supplies light through a fiber 

optic cable 24 which randomizes the polarization and creates a uniform light source for 
illuminating the wafer. Preferably, source 22 supplies electromagnetic radiation having 

wavelengths in the range of at least 1 80 to 800 nm. Upon emerging from fiber 24, the 
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radiation passes through an optical illuminator 26 that may include an aperture and a 
focusing lens or mirror (not shown). The aperture causes the emerging light beam to 
illuminate an area of structure 16. The light emerging from illuminator 26 is polarized by 
a polarizer 28 to produce a polarized sampling beam 30 illuminating the structure 16. 
5 The radiation originating from sampling beam 30 is reflected by structure 16, 

passed through an analyzer 32 and to a spectrometer 34 to detect different spectral 
components of the reflected radiation, such as those in the spectrum of the radiation 
source 22, to obtain a signature of the structure. In one mode (spectrophotometry mode) 
of operation, the reflected intensities are then used in a manner described below to find 

10 the value(s) of one or more parameters of structure 16. The system 10 can also be 
modified by placing the spectrometer 34 on the side of structure 16 opposite to 
illumination beam 30 to measure the intensities of radiation transmitted through 
structure 16 instead for the same purpose. These reflected or transmitted intensity 
components are supplied to computer 40. 

15 Alternatively, the light reflected by the structure 16 is collected by lens 54, and 

passes through the beam splitter 52 to a spectrometer 60. The spectral components at 
different wavelengths measured are detected and signals representing such components 
are supplied to computer 40. The light reflected by structure 16 may be supplied by 
source 22 through illuminator 26 as described above or through other optical components 

20 in another arrangement. Thus, in such arrangement, lens 23 collects and directs radiation 
from source 22 to a beam splitter 52, which reflects part of the incoming beam towards 
the focus lens 54 which focuses the radiation to structure 16. The light reflected by the 
structure 16 is collected by lens 54, passes through the beam splitter 52 to 
spectrometer 60. 

25 When the system 10 is operated in another mode (spectroscopic ellipsometry 

mode) used to measure the changes in polarization state caused by the diffraction by the 
structure, either the polarizer 28 or the analyzer 32 is rotated (to cause relative rotational 
motion between the polarizer and the analyzer) when spectrometer 34 is detecting the 
diffracted radiation from structure 16 at a plurality of wavelengths, such as those in the 

30 spectrum of the radiation source 22, where the rotation is controlled (not shown) by 

computer 40 in a manner known to those skilled in the art. The diffracted intensities at 

different wavelengths detected are supplied to computer 40, which derives the changes in 

polarization state data at different wavelengths from the intensities in a manner known to 
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11 

705303 vl 

those in the art. See for example U.S. patent No. 5,608,526, which is incorporated herein 
by reference. 

Fig. IB is a cross-sectional view of the structure 16 on substrate 12, which 
structure comprises a diffracting structure 16b situated between the film stack 16a above 
5 the structure and the film stack 16c underneath the structure and an incident 
electromagnetic beam 30 to illustrate the invention. Thus, the incident beam 30 of the 
electromagnetic radiation first encounters the interface between the air and the film 
stack 16a and interfaces that may be present within the stack. Next, the portion of the 
radiation from beam 30 that penetrates the film stack 16a is diffracted by the grating 
10 structure 16b, At least some of the radiation from beam 30 will reach the film stack 16c 
underneath the grating and be reflected by or transmitted through interfaces associated 
4i with stack 16c. The total light reflectance is affected both by the grating and by the film 

stacks above and/or below the grating. Multi-layer interference, caused by multiple 
reflections between the films and the grating, creates a complicated pattern in a 
01 15 reflectance spectrum, which can be used for measuring parameters of the structure. A 

I * part of radiation from beam 30 that is not reflected or diffracted as described above will 

be transmitted into the substrate 12, As shown in Fig. IB, the grating 16b has a height of 
H H, a critical dimension ("CD") and a sidewall angle (SWA) as indicated. 

~Z Fig. 2 is a schematic view of an alternative spectroscopic measurement system 80 

O 20 to illustrate the invention. The system of Fig. 2 differs from that in Fig. 1 A in that it uses 

the same optical components for both the spectrophotometry mode measurement as well 
as the ellipsometry measurement, and thus has fewer optical components. On the other 
hand, the two modes need to be employed sequentially and not simultaneously as is 
possible with the apparatus of Fig. 1A. As before, where there is relative rotational 
25 motion between the polarizer 28 and analyzer 32 when a measurement is taken, the 
system 80 of Fig. 2 operates as an ellipsometer. This can be achieved by rotating either 
the polarizer 28 or the analyzer 32, or both. Where there is no relative rotation between 
polarizer 28 and analyzer 32 (such as where both did not rotate, or rotate at the same 
speed), instrument 80 operates as a spectrophotometer or reflectometer. 
30 As shown in Fig. 2, system 80 further includes a beam divider 82 which diverts 

the portion of the illumination beam from source 22 to a spectrometer 84 which measures 
variations in the intensity of the illumination beam so that the effects of such variations 
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may be removed from the measurements. Beam shaping optics 86 is employed to shape 
the illumination beam, such as by collimating or focusing the beam. 

While some diffracting structures may take on simple geometric shapes such as 
that illustrated in Fig. IB, in some instances, these structures can take on more complex 
shapes. When this is the case, it is desirable to provide a model by which a much wider 
variety of profiles of structures can be predicted than can conventional models. 
Figs. 3A-3D illustrate the type of structures that may be encountered during the wafer 
manufacturing process. Fig. 3 A is a cross-sectional view of a line grating on top of a film 
stack, where the cross-section of each line is in the shape of a trapezoid 92, and the film 
stack comprises layers 94a (bottom anti-reflection coating, or BARC), 94b (polysilicon), 
94c (silicon dioxide) on top of a substrate 12. 

Alternatively, the structure may comprise periodic lines where each line 
comprises a stack of several different materials, where the cross-sectional shape of the 
lines is curved. As illustrated in Fig. 3B., the diffracting structure comprises three layers: 
96a, 96b, 96c and the diffracting structure is located on top of a film 94 which may 
comprise one or more layers. The structure in Fig. 3B typically results from the process 
of shallow trench isolation ("STI"). Yet another example of a realistic structure 
encountered in wafer manufacturing is illustrated in Fig. 3C which comprises a line 
grating with sidewall spacers, made of a material different form that of the line grating. 
As shown in Fig. 3C, each line grating comprises a center portion 98a which is 
substantially rectangular in cross-section and two sidewalls 98a, 98b on the two sides of 
the rectangle where the line structures are situated on top of a film 94. The sidewall 
spacers of Fig. 3C are typically used to control the desired shape of polysilicon lines 98 in 
the process of reactive ion etching ("RDE")- 

Fig> 3D is a perspective view of a periodic structure with via holes where the 
holes may penetrate one or more layers. The via holes provide vertical connections from 
one metallization layer to another. Thus, the structure 16 of Figs. 1 A, 2 may include one 
or more of the diffracting structures and layers shown in Fig. IB, 3A-3D. From the 
shapes of the structures illustrated in Figs. 3A-3D, it will be evident that prior art 
methods, such as the one described in U.S. Patent 5,963,329, may be inadequate for 
measuring the more complex structures illustrated in such figures. 

Fig. 4A-4F illustrate examples of profile models which may advantageously serve 

as the seed profiles or profile types that may be employed to derive the actual profile of a 
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diffraction structure encountered in semiconductor manufacturing. Fig. 4A illustrates a 
cross-sectional view of a profile type comprising a single-material, multi-trapezoid 
profile, characterized by values of CD, height and sidewall angle for each trapezoid. 
When the sidewall angles are fixed at 90 degrees, this profile type becomes a multi-slab 
model. The bottom trapezoid models a footer. 

Fig. 4B is a cross-sectional view of a single-material, quartic profile which may 
be represented by the polynomial expression y = ax 4 , characterized by the height of the 
profile and coefficient value a. Fig. 4C is a cross-sectional view of a single-material, 
quartic profile with a bottom rounding (i.e., rounded footer), characterized by height, 
quartic coefficient a and parameters of the bottom rounding or footer. More than one 
model may be used for the footer, one example being a model using a smooth function 
(e.g. straight line as shown in Fig. 4A, or curved line such as that of a quadratic function). 
Fig. 4D is a cross-sectional view of a multi-material, etched quartic profile of the form 
y = ax 4 , characterized by coefficient a and the thicknesses of each of the three layers. 
Fig. 4E is a cross-sectional view of a two-material profile with sidewall spacers, 
characterized by height, edge and profiling parameters for the inner and outer materials of 
the spacers, such as common height value for the inner and outer materials, different CD 
values for the inner and outer materials, and a sidewall angle for the outer material. 
Fig. 4F is a perspective view of a three-dimensional structure with the via hole profile in 
a uniform layer, characterized by the height and hole parameters (radius for a circular 
hole). 

While the quartic profile is illustrated in Figs. 4B, 4C and 4D, profiles that can be 
described using other polynomial expressions may also be used and are within the scope 
of the invention, such as quadratic parabolas, or a combination of quartic and quadratic 
parabolas. In the same vein, while the profile type in Fig. 4A includes multiple slabs that 
are trapezoidal, slabs defined by one or more analytical functions, such as where one side 
of the trapezoid is curved, may be employed and are within the scope of the invention. 

The profile types in Figs. 4A-4F do not include layers of material which may lie 

above and/or below the actual diffracting structures measured. These layers can also be 

modeled as described below using parameters for such layers, such as thicknesses and 

indices of refraction (referred to herein also as film parameters), so that the models 

constructed using the profile types can take into account the layers above and/or below 

the diffracting structures measured. In addition, the profile types themselves may include 
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layers, such as those illustrated in Fig. 4D. The layers of the profile type in Fig. 4A can 
be modeled using not only geometric parameters, such as the coefficient a and height of 
each of the three layers, but also the complex index of refraction of each of the materials 
in the three layers of the profile itself. 
5 Before any measurement of structure 16 is made using the apparatuses in Figs. 1A 

and 2, a gallery of profile types such as those illustrated in Figs. 4A-4F is first prepared 
and stored in the database. Fig. 5A is a flowchart of profile and film measurement to 
illustrate a process using a model to measure the parameters of the diffracting structure. 
Where the structure is situated on and/or below one or more layers of materials, the 
10 model may also be used to measure one or more parameters of such layers. As shown in 
Fig. 5A, an off-line pre-processing tool 102 is used to provide the gallery described above 
together with the seed profile and film parameters associated with each of the profile 
types, such as the profile types illustrated in Figs. 4A-4F, together with layers over and 

below the profiles shown. The profile parameters can include, for example, CD, height, 
15 sidewall angle, parameters associated with polynomial expressions such as the 

coefficient a and height of quartic profiles, parameters of the bottom rounding and of the 

spacers, and the indices of refraction (n and k) parameters of materials of the line profile. 

The film parameters may include thicknesses of the layers and the indices of refraction 

(n and k). 

20 Tool 102 then computes from the profile types and their associated profile and 

film parameters, as well as initial values of such parameters (e.g. based on estimation, or 
the knowledge or simulation of the fabrication process), predicted spectra radiation data 
associated therewith in a diffraction solver 108. The operation of the diffraction 
solver 108 is illustrated in more detail in Fig. 5B. As shown in Fig. 5B, the profile type 

25 may be approximated by slabs (block 110). Eignvalues and S-matrices for each slab and 
each film underneath and/or over the profile type are computed (block 112). S-matrices 
are then propagated (block 114) to arrive at a spectrum (block 116), which is the 
predicted radiation data when a profile is measured using the instruments of Figs. 1 A, 2. 
For a detailed description of the modeling process applied by solver 108, please see the 

30 references below: 

M. G. Moharam, E. B. Grann, D. A. Pommet, and T. K. Gaylord, 
"Formulation of stable and efficient implementation of the rigorous 
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coupled-wave analysis of binary gratings," J. Opt. Soc. Am. A, voL 12, 
pp. 1068-1076 (1995); 

L. Li, "Formulation and comparison of two recursive matrix 
algorithms for modeling layered diffraction gratings," J. Opt. Soc. Am. A, 
5 vol. 13, pp. 1024-1035 (1996); and 

M. G. Moharam, "Coupled-wave analysis of Two-Dimensional 
Dielectric Gratings," PROC. SPIE, vol. 883, pp. 8-11 (1988). 
Returning now to Fig. 5 A, the spectra associated with the actual diffracting 
structure and the film(s) in structure 16 are then measured using the apparatus of either 
10 Fig. 1A or Fig. 2 (block 120) or any other suitable apparatus and the measured data is 
then compared with the predicted spectrum from the diffraction solver 108 (block 122). 
m If there is a good match between the two spectra, the initial values of the parameters of 

the profile type and of the film(s) then correctly predict those of the actual structure and 
film(s) that are measured (block 124). If the match is less than satisfactory (block 126), 
01 15 the profile and film parameters (block 106) are then varied or adjusted by means of a 

J r nonlinear optimization tool (block 126) in a feed back path. The steps of the diffraction 

solver 108 and the comparison 122 are repeated until there is a satisfactory match 
M between the predicted spectrum and the experimental spectrum. Any number of 

fi nonlinear optimization tools may be employed, such as those described in the following 

O 20 articles: 

J. Nocedal and S.J. Wright, "Numerical Optimization," 
Springer- Verlag, New York, NY (1999); and 

D.T. Pham and D. Karaboga, "Intelligent Optimization 
Techniques: Genetic Algorithms, Tabu Search, Simulated 
25 Annealing and Neural Networks," Springer- Verlag, New York, NY 

(2000). 

As described above in reference to Figs. 3A-3D, the actual diffracting structures 
encountered in wafer processing include a wide variety of different shapes. According to 
one aspect of the invention, information concerning the manufacturing process may be 
30 advantageously used in selecting profile types for the gallery which serve as the seed 
profiles for the modeling process. Thus, the gallery of Figs. 4A-4F are selected keeping 
in mind the structures encountered in semiconductor manufacturing, such as those in 
Figs. 3A-3D. 

-12- 



705303 vl 



As noted above, semiconductor devices are fabricated by processing equipment 
with certain set parameters of the manufacturing process, such as the time, temperature, 
focus and exposure dose in the lithography and other parameters for deposition of certain 
layers or of etching processes. Once these parameters are known, it is possible to derive 
the profile of the structures that will result from such manufacturing process. A software 
tool that may be used to simulate the profile of the structures resulting from the 
manufacturing process is PROLITH™ simulator software, available from KLA-Tencor 
Corporation, the assignee of the present application, in San Jose, California. This 
software is described in Inside PROLITH, by Chris A. Mack, Finle Technologies (Austin, 
TX: 1997). Another possible tool that may be used to simulate the profile of the 
structures resulting from the manufacturing process is Solid_C, from Sigma_C, Munich, 
Germany. Thus, once information concerning the manufacturing process, such as the 
values of the manufacturing process parameters (e.g., time, temperature) is available, the 
profile that is predicted from such parameters may then be used to select a profile type 
from the gallery of profile types to serve as the seed profile for the modeling process 
illustrated in Figs. 5 A, 5B. In addition, the predicted profile arrived at using 
manufacturing process information may also be used to select a set of initial values of the 
parameters associated with the profile type, such as initial values of CD, sidewall angle, 
height, coefficient a and height of quadric expressions or coefficients of other 
polynomial-type expressions, and a process window in which these parameters are 
expected to vary. This process is illustrated in Fig. 6A. 

As illustrated in Fig. 6A, a lithography simulator 240 (e.g. PROLITH™ 
simulator) simulates, from parameters of manufacturing process 242, a line profile 244. 
From the simulated line profile 244, the profile type of Fig, 4A in the gallery that is the 
closest match to line profile 244 is then selected as the seed profile. The line profile 244 
is also used to select initial values of the different parameters of such profile type, so that 
the predicted profile using such profile type is the closest match to simulated profile 244. 
Thus, in the example in Fig. 6A 7 initial values of the seven parameters CDi, CD2, CD3, 
CD4 and Hi, H2, H3 are selected for the modeling process of Figs. 5 A, 5B so that the 
predicted profile 246 shown in Fig. 6A is the closest match to simulated profile 244. By 
making use of the manufacturing process as described above, a profile type with initial 
parameter values that is close to the actual structure being measured is selected as the 
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seed profile for the modeling process, so that the non-linear optimization process 
illustrated in Fig. 5 A can converge rapidly. 

The above-described modeling process starting with the seed profile or profile 
type and with the initial parameter values is illustrated in Fig. 6B. As noted above, from 

5 information available from the manufacturing process, it is possible to ascertain a process 
window in which parameter values may vary. Such window is illustrated in Fig. 6B, 
where the ranges of the parameters 1 and 2 shown are the ones through which these two 
parameters may vary. The window may be defined with respect to a center point 250, 
and an amount that each parameter is allowed to deviate from the value at this center 

10 point; the center point and the deviation allowed for each parameter are derived from the 
manufacturing information. The process window may be divided into different sections 
by a set of vertical lines and a set of horizontal lines, where the intersections between the 
two sets of lines form a set of points, each of which correspond to a pair of values for the 
two parameters. Solver 108 may be used to derive the radiation spectra corresponding to 

15 these pairs of values, where the spectra and their corresponding pairs form a coarse 
library. Where the profile type is characterized by more than two parameters, the window 
would be a space with more than two variables, and each intersection point would 
correspond to a set of more than two parameters. 

After this coarse library has been constructed, the spectra in the library are 

20 matched with the simulated data to find the closest match. The intersection point 252 
corresponding to the closest matching spectrum indicates the set of initial parameter 
values that is a good starting point to perform the optimization process of Fig. 5A. The 
line 254 in Fig. 6B illustrates schematically the path taken by the optimization process, 
arriving at the final result at point 260 in Fig. 6B. 

25 In the profile type of Fig. 4A, for example, there will be at least three parameter 

values: CD, height ("H") and sidewall angle ("SWA"). For some profile types, two 
parameters may be adequate, such as the quartic profile type of Fig. 4B, which may be 
characterized by the coefficient a and height of the profile. As noted above, the set of 
initial parameter values of the profile type selected is such that the predicted profile using 

30 the profile type from the gallery is the closest match to the simulated profile. Thus, as 

noted above in reference to Fig. 5B, a spectrum or spectra of a radiation parameter over a 

range of wavelengths 116 is arrived at using the diffraction solver 108 which corresponds 

to the set of initial values 252 of the profile type selected and its associated films. This 
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spectrum or spectra are then compared with the measured data as in block 122 of Fig. 5 A 
and a non-linear optimization tool may be utilized as described to arrive after 
convergence, along path 254, at a final set 260 of parameter values of the profile type. If 
the profile type of Fig. 4A is selected, for example, the final set 260 would comprise the 
final values of the CD, height and sidewall angle of each trapezoid. 

In order to speed up the process described in reference to Fig. 5 A, a coarse library 
such as that indicated in Fig. 6B may be pre-computed off-line, so that each profile type 
in the gallery is stored together with a number of sets of initial parameter values, such as 
those corresponding to the intersection points (e.g., 252) in the grid-like structure in Fig. 
6B, and their corresponding spectra. The diffraction solver 108 is then used to compute 
the spectrum corresponding to each of the intersection points and such spectra are stored 
together with the profile type and the associated sets of initial parameter values at the 
intersection points. Then, when a simulated profile becomes available, such as simulated 
profile 244 in Fig. 6A, such simulated profile is then matched against the predicted 
profiles that correspond to the different sets of initial parameter values corresponding to 
the intersection points in Fig. 6B in the coarse library. From this comparison, a particular 
intersection point in the grid-like structure and the corresponding set of initial parameter 
values may be quickly identified and the process of blocks 120, 122, 124 and 126 of Fig. 
5A may be carried out very quickly to locate the final set 260 of parameter values of the 
profile type. Thus, while the resolution of the coarse library of Fig. 6B is not sufficient 
for measurement, it provides significant acceleration of non-linear optimization. Where a 
coarse library is not constructed before hand, the center point 250, its corresponding set 
of parameter values and spectra, may be used as the starting point for the optimization 
process in Fig. 5A. 

The above-described radiation parameters may be measured in a manner known to 

those in the art, using the systems in Figs. 1A and 2. Another aspect of the invention is 

based on the observation that certain radiation parameters may be more sensitive to the 

change in one or more parameters associated with the profile type and related films than 

other radiation parameters. This is illustrated in Fig. 6C. From the profile type, its 

associated film(s) and the initial values 252 of the parameters selected, the diffraction 

solver 108 generates predicted spectra of different radiation parameters. Shown as 

examples in Fig. 6C are the spectra 270 for four different radiation parameters that are so 

generated: R s , R p , cosA and tan\|/. The different parameter values (e.g. CD, H, SWA) 
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associated with the profile type are then varied and the diffraction solver 108 is used to 
generate a set of different spectra for each of the four or more different radiation 
parameters. By comparing the change in spectra of the four or more radiation parameters 
corresponding to the same variation in parameter value (e.g. CD, H, SWA), the radiation 
parameter and its corresponding spectra that is the most sensitive to the change in 
parameter value is then identified. 

In other words, each of the selected profile types is varied. Thus, if the profile 
type of Fig. 4A is selected, then each of the parameters CD, H and SWA is varied. For 
each variation of each of the three parameters, diffraction solver 108 computes the 
corresponding spectrum for each of the four or more radiation parameters. A quantity x 2 
may be defined by the equation: 



This quantity (% 2 ) measures the difference between two sets of data R 1 and R 2 , which 
can be, e.g., the theoretical and the experimental values of a certain signal 



defined by the experimental uncertainty. In calculating % 2 in Fig.6C, actually two 

theoretical spectra are compared- one at the initial parameter values with the one at a 
modified parameter values. Quantities other than x 2 may also be used in optimization — 
for example, the cross-correlation between the compared spectra may be optimized. 

The quantity % 2 , along path 254,is thus computed according to the equation above, 
which is the difference between two theoretical spectra— one at the initial parameter 
values 252 and the one where one of the parameter values has been modified from its 
initial value. In the four sets of spectra 270 shown in Fig. 6C, a number of curves are 
computed for each of the four radiation parameters, where each curve corresponds to the 
theoretical spectrum with one of the parameters having a value that is modified compared 
to the initial value. The four quantities % z of the four radiation parameters corresponding 
to the same modification in CD, H or SWA are then compared to identify the radiation 
parameter that is the most sensitive to a change in CD, H or SWA, and its spectra. In the 
example 274 shown in Fig. 6C, x 2 is the largest for the radiation parameter tan\|/. 




(R s ,R p ,cosA,...). The values <7„set the weight of the n-th data point and are typically 
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Therefore, if the radiation parameter tan\|/ is chosen for the modeling process shown in 
Fig. 5A, a more accurate result may be achievable. In other words, when the apparatus of 
Fig. 1 A or 2 is used to measure the spectra associated with the diffracting structure and 
any associated films (block 120 in Fig. 5 A), the radiation parameter tan\|/ is measured 
over a range of wavelengths, and such spectrum is then compared (block 122) to tan\y 
generated by the diffraction solver 108 in the flowchart of Fig. 5A, to arrive at a more 
accurate set of values for the final set 260 of Fig. 6B. 

Fig. 6C illustrates four of the radiation parameters that may be used. A more 
complete list includes the following 12 radiation parameters: 

R s ,R p ,R s -i? p ,cosA, lzny/=\r p /r s l> 

R s /R p ,X =\r s ~r p \\Y =\r s +r p \\X /F,(X -Y)/(X +Y) 

_ R p cos 2 A- R s sin 2 A 
R n cos 2 A + i? sin 2 A ' 

iJR cos 2 aJr sin 2 A 

0 = V p cos A 

R cos A + R K sin A 

where r 5 and r p denote the complex amplitude reflection coefficients for S and P 

polarizations respectively, while R s and R p are the reflectivities for S and P polarizations 

respectively: R s =\ r s | 2 , R p =| r p | 2 . The angle A is the analyzer angle and can be 

(optimally) set by hardware configuration. The quantities tan \jr and cos A are 
ellipsometric parameters known to those in the art. 

It will be noted that the process described in reference to Figs. 5 A, 5B takes into 
account both profile and film parameters, so that the process described above in reference 
to Fig. 6C selects the radiation parameter and its associated spectra that is the most 
sensitive to a variation in a profile and/or film parameter. 

The advantages provided by different aspects of the process described above are 
set forth in the table below. 
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Figs. 6A, 6B, 6C. Off-line techniques developed and 
used in this invention that allow real-time measurement 
of profile and film stack parameters by the method of 
Figs. 5A, SB 



Method 


Advantage 


Analysis of manufacturing 
process inTormauon oy a 
lithography simulation tool 
(Fig.6A). 


Optimal choice of profile model 
and process window for 
parameters 


Selection of one or more 
signals (spectra) that are most 
sensitive to parameters of 
interest (Fig.6C) 


Abilitv to measure both Drofile 
and film parameters by 
selecting most sensitive signals 


Generation of a look-up table of 
eigenvalues in the grating 
region 


Replacement of the eigenvalue 
computation by interpolation to 
speed up the diffraction solver 
in the real-time measurement 


Generation of a coarse library 
of spectra within the process 
window (Rg.6B) 


Best initial seed to accelerate 
the convergence of nonlinear 
optimization 



Fig. 7 is a block diagram of an integrated spectroscopic diffraction-based 
metrology system , a photolithographic track/stepper and an etcher to illustrate another 
aspect of the invention. A layer of material such as photoresist is formed on the surface 
of a semiconductor wafer by means of track/stepper 350, where the photoresist forms a 
grating structure on the wafer. One or more of the CD, H, SWA and/or other parameters 
of the grating structure are then measured using systems 10, 80 of Fig. 1 A, 2 and one or 
more of the above-described techniques may be employed if desired to find the value(s) 
of the one or more parameters of the photoresist pattern and its associated film(s). Such 
value(s) from the computer 40 are then fed back to the track/stepper 350, where such 
information may be used to alter the lithographic process in track/stepper 350 to correct 
any errors. In semiconductor processing, after a layer of photoresist has been formed on 
the wafer, an etching process may be performed, such as by means of etcher 360. The 
layer of photoresist is then removed in a manner known in the art and the resulting 

grating structure made of semiconductor material on the wafer may again be measured if 
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desired using system 10 or 80, The value(s) measured using any one or more of the 
above-described techniques may be supplied to the etcher for altering any one of the 
etching parameters in order to correct any errors that have been found using system 10 or 
80. Of course, the results obtained by one or more of the above described techniques in 

5 system 10, 80 may be used in both the track/stepper and the etcher, or in either the 
track/stepper or the etcher but not both. The track/stepper 350 and/or etcher 360 may 
form an integrated single tool with the system 10 or 80 for finding the one or more 
parameters of a diffracting structure, or may be separate instruments from it. 

Fig. 8 is a schematic view of the track/stepper 350 and an associated flowchart 

10 illustrating a process for semiconductor wafer processing to illustrate in more detail the 
points of integration of the processing process with the detection of profiles of diffracting 
structures and associated films to illustrate in more detail a part of the process in Fig. 7. 
As shown in Fig. 8, a semiconductor wafer 352 may be loaded from a cassette loader 354 
to several stations labeled "prime," "coat," "soft bake " "EBR." Then the wafer 352 is 

15 delivered by a stepper interface 356 to exposure tool 358. The different processes at the 
four locations mentioned above are set forth below: 

At the location "Prime", the wafer undergoes chemical treatment before a layer of 
photoresist is spun on it, so that the photoresist layer can stick to wafer. At the location 
"Coat", a layer of photoresist coating is spun onto the wafer. At "Soft bake", the layer of 

20 resist is baked to remove chemical solvent from the resist. At "EBR" which stands for 
"edge-bead removal", a solvent nozzle or laser is used to remove excess photoresist from 
the edge of wafer. 

After the wafer has been exposed to radiation by tool 358, the wafer then 
undergoes four additional processes: "PEB," "PEB chill " "Develop " and "Hard bake." 

25 At "PEB or post exposure bake", the wafer is baked to reduce standing-wave effect from 
the exposure tool. Then it is cooled at "PEB chill". The wafer is then washed with 
reagent to develop the photoresist, so that un-exposed (negative) or exposed (positive) 
photoresist is removed. The wafer then is baked at "Hard bake" to stabilize the 
photoresist pattern. It will be noted that all of the components of device 350 of Fig. 8 

30 except for the "exposure tool" 358 is known as the "Track" (also called cluster). 

After these latter four processes have been completed, the wafer 352 is then 

returned to the cassette loader 354 and this completes the processing involving the 

stepper 350. The detection system 10 or 80 may be applied at arrow 362 to measure the 
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parameters of the diffracting structure and associated film(s). Thus, such parameters may 
be measured after "hard bake." 

Software Upgrades 

5 The invention has been described above, employing a system such as that shown 

in Fig. 1A and 2. While the various optical components in the system of Fig. 1 A and 2 
are used to obtain measured data from the sample, many of the other processes are 
performed by computer 40 (not shown in Fig. 2 to simplify the figure). Thus, for many 
systems currently being used by manufacturers such as semiconductor manufacturers, the 

10 computers used in the systems may not have the capability to perform the techniques 
described above. Thus, another aspect of the invention envisions that the software in 
these computers can be upgraded so that computer 40 can perform one or more of the 
above described different functions. Therefore, another aspect of the invention involves 
the software components that are loaded to computer 40 to perform the above-described 

15 functions. These functions, in conjunction with the optical components of system 10 or 
80 in Fig. 1A or 2, provide results with the different advantages outlined above. The 
software or program components may be installed in computer 40 in a variety of ways. 

As will be understood in the art, the inventive software components may be 
embodied in a fixed media program component containing logic instructions and/or data 

20 that when loaded into an appropriately configured computing device to cause that device 
to perform according to the invention. As will be understood in the art, a fixed media 
program may be delivered to a user on a fixed media for loading in a users computer or a 
fixed media program can reside on a remote server that a user accesses through a 
communication medium in order to download a program component. Thus another 

25 aspect of the invention involves transmitting, or causing to be transmitted, the program 
component to a user where the component, when downloaded into the user's device, can 
perform any one or more of the functions described above. 

Fig. 9 shows an information appliance (or digital device) that may be understood 
as a logical apparatus that can read instructions from media 417 and/or network port 419. 

30 Apparatus 40 can thereafter use those instructions to direct server or client logic, as 
understood in the art, to embody aspects of the invention. One type of logical apparatus 
that may embody the invention is a computer system as illustrated in 40, containing CPU 
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404, optional input devices 409 and 411, disk drives 415 and optional monitor 405. 
Fixed media 417 may be used to program such a system and may represent a disk-type 
optical or magnetic media, magnetic tape, solid state memory, etc.. One or more aspects 
of the invention may be embodied in whole or in part as software recorded on this fixed 
media. Communication port 419 may also be used to initially receive instructions that are 
used to program such a system to perform any one or more of the above-described 
functions and may represent any type of communication connection, such as to the 
internet or any other computer network. The instructions or program may be transmitted 
directly to a user's device or be placed on a network, such as a website of the internet to 
be accessible through a user's device. All such methods of making the program or 
software component available to users are known to those in the art and will not be 
described here. 

The invention also may be embodied in whole or in part within the circuitry of an 
application specific integrated circuit (ASIC) or a programmable logic device (PLD). In 
such a case, the invention may be embodied in a computer understandable descriptor 
language which may be used to create an ASIC or PLD that operates as herein described. 

While the invention has been described above by reference to various 
embodiments, it will be understood that changes and modifications may be made without 
departing from the scope of the invention, which is to be defined only by the appended 
claims and their equivalents. All references referred to herein are incorporated by 
reference in their entireties. 
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